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Abstract 



In this paper the quantum version of the source coding theorem is obtained for a 
completely ergodic source. This result extends Schumacher's quantum noiseless coding 
theorem for memoryless sources. The control of the memory effects requires some ear- 
lier results of Hiai and Petz on high probability subspaces. Our result is equivalently 
!> ' considered as a compression theorem for noiseless stationary channels. 
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<N : I. Introduction 

o\ 

Although it is difficult to define a discipline, to give some idea we can say that the objective 
of quantum information theory is the transmission and manipulation of information stored in 
systems obeying quantum mechanics. A quantum channel has a source that emits systems in 



quantum states to the channel. For example, the source could be a laser that emits individual 
monochromatic photons and the channel could be an optical fiber. The noisy signal output of 
qh the channel arrives at the receiver. In principle, there are two very different problems about 
quantum channels. The sender has a quantum system in an unknown state and wants to 
have the receiver to end up with a similar system in the same state. In this case we speak of 
a pure quantum channel which has a quantum mechanical input and output. On the other 
hand, one might want to use quantum states to carry classical information, roughly speaking 
a sequence of zeros and ones. Now both the input and the output are classical, however 
there is a quantum mechanical section inbetween. The classical information is encoded into a 
quantum state and this is sent down the channel. The higher the channel noise is, the more 
redundant the encoding must be in order to restore the original signal at the reciever, where 
the quantum signal is converted into classical information. In this paper we do not deal with 
the problem how such a scheme can be realistically implemented; practical quantum encoding 
and decoding requires sophisticated ability to manipulating quantum states. However, we 
are interested in the amount of classical information getting through the channel which is 
assumed to be noiseless. It was emphasized already by Shannon that a computer memory is 
a communication channel. (Quantum or classical depends on the type of the computer.) In 
an optimal situation the computer memory is free of any noise and this is the case we are 
concentrating on in the present paper. We want to consider rather general noiseless quantum 
channels (with possibly memory effects but strong ergodic properties) and our aim is to discuss 
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the quantum source coding theorem. As a general reference on quantum information theory 
we suggest the recent book Q but the really necessary definitions are given below. 

To each classical input message Xi there corresponds a signal state (fi of the quantum 
communication system. The quantum states (fi are functioning as codewords of the messages. 
The signal states ipi could be pure and orthogonal in the sense of quantum mechanics but 
for example in quantum cryptography nonorthogonal states are used intentionally in order to 
avoid eavesdropping. At the moment we do not impose any condition on the signal states, 
they could be arbitrary pure or mixed states. In the stochastic model of communication, one 
assumes that each input message Xi appears with certain probability. Let pji be the probability 
that the message Xi is sent and jjj is recieved. The joint distribution pji yields marginal 
probability distributions and qj on the set of input and output messages. According to 
Shannon the mutual information 



measures the amount of information going through the channel from Alice to Bob. Of course, 
the relation of / to the quantum encoding and decoding should be made clear. This comes 
next. 

The message Xi has a priori probability pi and the mixed quantum state of the channel is 



This might be considered as the statistical operator of the mesagge ensemble, for example 
when ipi is a pure state then tp = J^iPS)^ ac ^ s 011 the input Hilbert space TC. The 

distribution of the output is determined by a measurement, which is nothing else but a physical 
word for decoding. To each output message there corresponds an obsevable Aj on the output 
Hilbert space /C. It is customary to assume that < Aj, J2j Aj = id (id stands for the identity 
operator) and pji = Pitpi(Aj). The so-called Kholevo bound (0) provides an upper bound on 
the amount of information accessible to Bob in terms of von Neumann entropies: 



(When Ai, A2, • • • are the eigenvalues of the statistical operator of a quantum state if), then 
S(i/;) = — J2 k Xk log(Afc).) In particular, if all signal states cpi are pure, then S(<pi) = and 
we have / < S((p). In this way the von Neumann entropy gets an information theoretical 
interpretation. Kholevo's bound is actually not very strong, it is attained only in trivial 
situations (0). 

The basic problem of communication theory is to maximize the amount of information 
received by Bob from Alice. However, up to now this problem is not well-posed in our 
discussion yet. Let us deal with messages of length n, they are n-term-sequences of and 
1. (So the size of this message set is 2 n .) For each message length n we carry out the above 
procedure of coding and decoding and the amount of information going through the channel 
is I n . Since I n is presumably proportional to n, the good information quantity is I n /n, that 
is, the transmitted information per letter. Since Shannon's theory is not only stochastic but 
asymptotic as well, we are going to let n to 00. In this way we need to repeat the above 
information transmission scheme for each n. The mesagge set, the input Hilbert space «<»> 
our coding, the channel state (p^ n \ the output Hilbert space 10™ and the observables applied 
in the measurement are all depending on the parameter n. 





I < S(<p)- ^2piS((fi) 
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The subject of the present paper is faithful signal transmission, which bears the name 
noiseless channel. In place of faithful transmission, one can think of information storage. In 
this case the aim is to use the least possible number of Hilbert space dimension per signal 
for coding. The new feature of the noiseless channel we are studying is the memory effect. 
Mathematicaly this means that the channel state (of the n-fold channel) is not of product type 
but we assume stationarity and good ergodic properties. In Section 2 we use the standard 
formalism of statistical mechanics to describe such a channel. It turns out that the mean 
von Neumann entropy, familiar also from statistical mechanics, gives the optimal coding rate. 
The proof of our main result, Theorems [l] and 0, is similar to the proof presented in @ for 
Schumacher's coding theorem, however instead of typical sequences we use the high probability 
subspace of strongly ergodic stationary states, a subject studied by Hiai and Petz in ||. We 
note for the interested reader that most of the concepts used in the present paper are treated 
in details in the monograph ||. 

II. An infinite system setting of the source 

If 7i is a finite dimensional Hilbert space then (A, B) > Tr (A*B) defines an inner product 
on B(H), so for every linear functional <p on B(H) there exists a unique D v G B(H) with the 
property if (A) = Tr (D^A). When ip is a state then D v is the corresponding density matrix. 
Let X n denote the set of all messages of length n. If x n G X n is a message then a quantum 
state tp(x n ) of the n-fold quantum system is corresponded with it. The Hilbert space of the 
n-fold system is the n-fold tensor product 7i® n and (p(x n ) has a statistical operator D(x n ). 
If messages of length n are to be transmitted then our quantum source should be put in the 
state (p n = J2x n p(x n )(p(x n ) with statistical operator D n = J2x n p{x n )D(x n ), where p(x n ) is 
the probability of the message x n . Since we want to let n — > oo, it is reasonable to view all 
the n-fold systems as subsystems of an infinite one. In this way we can conveniently use a 
formalism standard in statistical physics, see Chap. 15 of |J. 

Let an infinitely extended system be considered over the lattice Z of integers. The observ- 
ables confined to a lattice site k G Z form the self adjoint part of a finite dimensional matrix 
algebra Ak, that is the set of all operators acting on the finite dimensional space 7i. It is 
assumed that the local observables in any finite subset A C Z are those of the finite quantum 
system 

A\ = ® A k - 

keA 

The quasilocal algebra A is the norm completion of the normed algebra Aoo = Ua^4a, the 
union of all local algebras A\ associated with finite intervals A C Z. 

A state ip of the infinite system is a positive normalized functional A — > C. It does not 
make sense to associate a statistical operator to a state of the infinite system in general. 
However, if restricted to a finite dimensional local algebra A\ admits a density matrix D\. 
We regard the algebra A[i >n ] as the set of all operators acting on the n-fold tensor product 
space 7i® n . Moreover, we assume that the density D n from the first part of this section is 
identical with Dn n y Under this assumptions we call the state (p the state of the (infinite) 
channel. Roughly speaking, all the states used in the transmission of messages of length n 
are marginals of this ip. Coding, transmission and decoding could be well formulated using 
the states ip n = <P[i,n]- However, it is more convenient to formulate our setting in the form of 
an infinite system, particularly because we do not want to assume that the channel state <p> is 
a product type. This corresponds to the possibility that our quantum source has a memory 
effect. 
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The right shift on the set Z induces a transformation 7 on A. A state ip is called stationary 
if ip o 7 = <£). The state <p is called ergodic if it is an extremal point in the set of stationary 
states. Moreover, (p is completely ergodic when it is an extreme point for every meNin the 
convex set of all states ip such that ip o 7" 1 = ip. By a completely ergodic stationary quantum 
source we simply mean a completely ergodic stationary state (p of the infinite system A. 
Of course, a stationary product state, corresponding to a memoryless channel, is completely 
ergodic. The emphasis is put to other states here. 

Below we show an example of a completely ergodic stationary quantum source from the 
context of algebraic states. For the details see the original paper [^]. 



Example 1. Let A := M 3 (C) ; B := M 2 (C), moreover let {Ey}? - =1 be the usual matrix units 
o/M 3 (C). Set 

" 0' 



Vi:-- 
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Lei p be a state on B with density matrix 

1 

Define T, : A® B B by £(-Ejj ® x) := V*xVj. It is easy to check that T, is a completely 
positive unital map and p(E(I^ ® a;)) = p(a;), 1 G S. 

T/ien t/ie algebraic state ip generated by (B, S, p) is g'zwen by 

V {E hjl ® . . . ® E Wn ) = . . . V£V in . . . V h ). 
It is shown in FN that ip is completely ergodic. Of course, it is not a product state. 



It is well-known in quantum statistical mechanics that due to the subadditivity of the von 
Neumann entropy (proven first in || by Lieb and Ruskai) the limit 

1 1 

lim -S(<p n ) = mf-S(<p n ) =: h 
n— >+oo n n 

exists for any stationary state and this quantity is called the mean entropy of ip. (See || for 
a textbook treatment of the subject or for some related properties of the mean entropy.) 



III. Source coding 

For a while we fix a message length n and we denote by d the dimension of the Hilbert space 7i. 
Assume that our n-fold composite quantum system is operating as a quantum source and emits 
the quantum states D^ l \ D^ 2 \ . . . , _D( m ) with a-priory probabilities pi,P2, ■ ■ ■ ,p m - (Therefore 
the state of the system is D n = £jPiL)W.) By source coding we mean an association 

D (i) ^ 

where is some other statistical operator on the Hilbert space 7i® n . (This definition allows 
£)W = but Z?W however in the coding constructed in the proof of Theorem 1 this 

cannot happen.) 



4 



We denote by K n the subspace spanned by the eigenvectors corresponding to all nonzero 
eigenvalues of all statistical operators 1 < i < m. The goal of source coding is to keep 
the dimension of K n to be small and to fulfil some fidelity criterium. (A mathematically 
demanding survey about quantum coding is the paper |lOf| The source coding rate 

logdim(/C n ) 
lim sup 

expresses the resolution of the encoder in qubits per input symbol. (It is actually more precise 
to speak about "qunats" per input symbol, but the difference is only a constant factor.) 

The distortion measure is a number which allows us to compare the goodness or badness 
of communication sytems. The fidelity of the coding scheme was introduced by Schumacher 

my- 

F ■= ^2 Pi TrD®D®, 

i 

where pi is a probaility distribution on the input and D^ l > is the density used to encode the 
density D^ l \ Note that < F < 1 and F = 1 if and only if D^> = D^ l > are pure states. 

First we present our positive source coding theorem for a completely ergodic source. The 
result says that the source coding rate may approach the mean entropy while we can keep the 
fidelity arbitrarily good. 

Theorem 1. Let 7i be a finite dimensional Hilbert space, and if be a completely ergodic state 
on B{7i)® co . Then for every e, 5 > there exists n E|( j G N such that for n > n E> $ there is a 
subspace fC n (s,5) ofTL® n such that 

(i) logdim/C n (e, 5) <n(h + S) and 

(ii) for every extremal decomposition D n = YliLiPiD^ one can find an encoding t— > 

with density matrices supported in JC n (e, 5) such that the fidelity F := P«Tr D^'D^ 1 ' 
exceeds 1 — e. 

The negative part of the coding theorem tells that the source coding rate cannot exceed 
the mean entropy when the fidelity is good. 

Theorem 2. Let TC be a finite dimensional Hilbert space, and if be a completely ergodic state 
on B{7i)® co . Then for every 5 > there exist < rj < 1 and n$ G N such that for n > n$ 

(i) for all subspaces K n of 7i® n with the property log dim K n < n(h — 5) and 

(ii) for every decomposition D n = ^Z^LiPiD^ and for every encoding i— > D^' with 
density matrices supported in fc n , the fidelity F := Y1T= iP«Tr D^D^ is smaller 
than rj. 

The detailed proofs are given in the next section of the paper. Now we make some comments 
on the fidelity F. It is possible that F < 1 although = D®. This fact might suggest to 
use another concept of fidelity. Since D 1 ! 2 > D holds for a density matrix, we have 

Tr D^D^ 2 = Tr D^D^D^ > Tr D X X ^D 2 D X X ^ = Tr D^ 2 D\ /2 D^ 2 
> Tr D 2 1/2 D 1 D 2 1/2 = Tr F) X D 2 . 



This implies that 

F':=J2PiTriD^] 1/2 [D^] 1/2 >F. 

i 

Both our positive and negative source coding theorems hold if F is replaced by F 1 . (In case 
of Theorem [1] this follows from the inequality F' > F and in the proof of Theorem |2] we will 
show F' < 77.) 



IV. High probability subspace 

The proof of Shannon's original source coding theorem is based on the typical sequences 
(|12||, Chap. 1). The quantum extension of this result obtained by Schumacher still benefits 
from the classical result. When the channel state is a product, the densities D n commute and 
simultanous diagonalization is possible. If the memory effects are present, then these densities 
do not commute and in some sense we are in a really quantum mechanical non-commutative 
situation. Nevertheless, the high probability subspace can be used but new techniques are 
required. 

Let /C be a Hilbert space and D be a density matrix on /C. D has a Schatten decompo- 
sition D = ^2i^i\fi){fi\, where |/j)'s are eigenvectors and the eigenvalues A, are numbered 

decreasingly: Ai > A2 > Choose and fix < e < 1. Let n(e) be the smallest integer such 

that 

n(e) 

x i > 1 - e ■ 

i=l 

The subspace HP(D,e) spanned by the eigenvectors . . . , |/ n ( e )) is called the high prob- 
ability subspace corresponding to the level e. Note that HP(D,e) is not completely well- 
defined, if there are multiplicities in the spectrum of D, then the Schatten decomposition is 
not unique. However, the dimension n(e) of HP(D,e) is determined. The term "high proba- 
bility subspace" is borrowed from the monograph |13| and its role in macroscopic uniformity 



was discussed in [14 



In the following, tp will be a completely ergodic state on A®°°. For e G (0,1) let 

:= infjlogTr n (g)): q G P(^"),^ n (g) > 1 - e}, 

where P( v 4® n ) denotes the set of projections of A® n . (exp /3 £i „ is the dimension of the high 
probability subspace.) It was shown in || (and formulated in terms of relative entropy) that 

limsup-/3 £i „ < h, (1) 

n— >+oo H 

liminf-/3 £ n > h - logd. (2) 

n-»+oo n 1 — e 1 — e 

From this one can deduce the following 
Proposition . For every positive 5 

(i) and for every positive e there exists N £t $ G N such that for every n > N £y s there exists a 
projection q n (e,5) in A® n such that 

log(Tr n (g n (e,5))) < n(h + 5) and (p n (q)>l-s, 



(ii) there exists 1 > rj > and N$ G N suc/j /or every n > N$ and for every projection 
q m A m 

log(Tr n (g)) < n(h - 5), 

implies <fin{c[) < 77 ■ 

Part (i) of the Proposition is a plain reformulation of (H). In order to see (ii) we first note 
that 

1 l-n 

—h log d — > h as 77 — > 1 . 

77 77 

Hence given 5 > we choose < 77 < 1 such that 

-h - - — -logd > h - 5 . 
77 77 

Next we replace 1 — 77 by 1 — 5 in (Q) : 

liminf — inf{logTr n (g)): g G PU® n ).^(?) > 1 — 77} > —A - ^— ^logd > ft - 6. (3) 

re->+oc 77 77 7/ 

In this way we arrived at (ii). 

Next we prove the source coding theorem. 

Proof of Theorem]!^ Use part (i) of the Proposition and set q n := q n (e/2,6) , K, n (e,5) := 
Rang n , where n > n(e, 5) := N £ / 2 ^. Given an extremal decomposition D n = Yli=iPiD® , that 
is £)W = \xi)(xi\ for some vectors Xi, we construct the coding densities Let 



\q n Xi\\ 



&i ■ || ^n-^i || 5 Pi ■ || {J Qn)%i 



and let x be any unit vector such that q n x = x. Then we set 

£)« := <4\xi){xi\ + (3 2 \x)(x\. 

Since x~i,x G K, n (e,5), we have suppZ)^ C K, n (e,5). Furthermore, 

TrD^m = (x i ,D^x l ) = a 2 \(x l \x i )\ 2 + f3 2 \(x i ,x)\ 2 
> a%\(xi\xi)\ 2 = a\ > 2a 2 - 1 
= 2Trg„D (i) - 1. 

We need to sum over i: 

J^Pi^D^D® > ^]9 i (2Trg nJ D« - 1) = 2Tr D n q n - 1 = 2 Vn {q n ) - 1 > 1 - e. 



Proof of Theorem ^ For the given S we choose 77 and n(S) according to the Proposition. 
Let q be the projection onto the subspace K n . We want to use the Schwarz inequality in the 
form 



1/2 
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for x t = [D^] l / 2 q and y t = [D®] 1 / 2 . Since [D®] 1 / 2 = q[D^} 1 ' 2 follows from the hypothesis, 
we have 



F' = ^PiTr[L>«] 1 /2[ J D«]V2 = ^ p . I¥ [ jD W]i/2 ?[ f ) (i)]i/2 

i=l i=l 

m m . ,_ 



t=i i=i 

= ^(g) 1/2 <v^- 

This estimate completes the proof. 

It is known that for strongly mixing algebraic states (0) and for ergodic Gibbs states (||14|) 

lim -f3 £ „ = /i 

ra— >+oo n 

and in this case the negative part of the coding theorem holds in a stronger form: For every 
e, 5 > there exists n £t s £ N such that for n > n £t s for all subspaces /C n of l~L® n with the 
property log dim K n < n(h — 5) and for every decomposition D n = 'Y^iPiD^ and for 
every encoding i— > l)^-* with density matrices supported in /C ra (e,5), the fidelity 
F := Y^iLiPiT* is smaller than e. 

There is a seemingly slight difference between the two theorems. The statistical operator 
D n has an extremal decomposition in the first one and arbitrary decomposition in the second. 
The difference between the pure and mixed message ensemble is discussed in the recent paper 



V. Discussion 

In this paper a theory of quantum source coding subject to a fidelity criterion or quantum 
data compression is presented. The minimum of the source coding rate is studied under the 
conditions that Schumacher's fidelity must exceed 1 — e and the quantum mechanical state 
of the channel has a strong ergodic property. This latter condition allows many states with 
memory effect. For the mathematical model and in the proof of the main result techniques 
of quantum statistical mechanics are used. We prove that the minimal source coding rate is 
the mean entropy of the channel state, and, to some extent, it is independent of the message 
ensemble. 
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